CDS
Accession Number | TCMCG002C05009 |
gbkey | CDS |
Protein Id | XP_020084459.1 |
Location | complement(join(9865089..9865109,9865242..9865373,9865496..9865723,9866213..9866290,9866381..9866446,9866551..9866631,9869334..9870094,9870420..9870613,9871026..9871120,9872176..9872316,9872755..9872832,9873289..9873417,9874954..9875088,9875979..9876095,9876219..9876270,9876422..9876553,9877147..9877199,9877331..9877426,9877750..9877895,9879075..9879210,9880013..9880166,9882223..9882369,9882851..9883233)) |
Gene | LOC109707540 |
GeneID | 109707540 |
Organism | Ananas comosus |
Protein
Length | 1184aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA371634 |
db_source | XM_020228870.1 |
Definition | probable cleavage and polyadenylation specificity factor subunit 1 isoform X3 [Ananas comosus] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGAGCTACGCGGCCTTCAAAATGATGCATTGGCCCACGGGGATCGAGAACTGCGCCGCCGGGTTCTTCACCCACTCCCCATCCTCCTCCTCCTCCTCCTCCGCCGCCGCCGCGGAGGCGACGGCGGCCGCCGAGATCCCTCCCCTGCCCGGCGACGAACTCGAGGCGGCGGAGTGGCAGCAGGGGAGGAGGCGACGCGGGGTTGGCGCCGTTCCCGACCTCGTCGTCACGGCGGGGAACGTGCTCGAGATCTACGTAGTTAGGGCGCAGGAGGACGAGGGGAGAGCCCCTCGGGCCTCCGGCGAGCAGAAGCGCGGCGGCGGAGGCGGCGGCGTTGTCGACGGGATCTCTGGTGCCCGCCTCGAGCTCGTGTGCCACTATCGCTTGCATGGCAATGTGGAATCAATGGCAGTTTTATCTGTTGGAGCTGATAATCGCAGCAACAGAAGAGATTCTATTGTCCTAGCATTCCAAGATGCAAAAATCACCGTGTTAGAATATGATGATTCACTACATGGACTGCGGACAAGTTCCATGCACTGTTTTGAAGGGCCAGACTGGCAGTACCTGAAAAGAGGCAGGGAGTCGTTTGCCCGTGGTCCTATTGTAAAGGCCGATCCCTCAGGCCGATGTGGTGGAGCACTTGTTTATGGGCTTCAAATGATTATTCTTAAAGCTGCTCAGGCTGGGCAGAGTTTAGTTGGAGATGATGAGCCTAACAGTGCTGGAGGCACTATCTCTGTTCGTATTGAGTCATCTTATGTGATTAATTTGCGTGAATTAGACATGAATCATGTCAAAGATTTCACATTCGTACATGGTTATATTGAGCCTGTTATGGTTATACTTCATGAAAGAGAGCCTACATGGGCTGGCCGTATCTCATGGAAGCACCACACATGCATGATTTCTGCGCTTAGCGTCAGCACAACTTTGAAACAACATCCGATGATATGGTCTGCATCTAATCTTCCACATGATGCATACAAACTTCTTGCGGTGCCTTCACCAATTGGCGGTGTTCTTGTGATCTGTGCAAACTCCATACACTATCACAGTCAGTCCGCATCTTGCTCTCTGAGTCTCAACAGTTTCTCATCGCAGCCGGATGGCAGTCTTGAAATGCCTAAATCGAACTTCGCTGTGGAGCTTGATGCAGCTCATGCAACATGGTTATCACATGATGTTGCTATGTTCTCATCAAAGACTGGAGAATTACTATTGCTCACCTTGGTCTATGATGGAAGAATTGTGCAGAGACTTGATCTTGTGAAATCCAAAGCTTCAGTTTTGACCTCGGGTTTGACAACTATTGGGAGTTCATTCTTCTTCCTTGGCAGTCGCCTGGGAGACAGCCTCCTTGTGCAATATAGCTGCGGGACGTCAGTGCCAACTTCTAGCCAAGTGAAAGATGAGGCTACTGATATTGATGGTGATGTCCCTTCAGCAAAGCGATTAAGAAGGATGTCTTCAGATGCTTTACAAGATGTTACCAGTGTTGAGGAGCTGTCTTTGTATAATAATGCTCCGAACAGTTCAGAGTCAGCACAGAAGTCCTTCTCATTCGCGGTTAGAGATTCGTTAATTAATGTTGGCCCGTTGAAGGATTTTTCTTATGGTTTAAGGATCAATGCAGATCCCAATTCCACAGGACTTGCTAAGCAGAGCAACTATGAGTTGGTATGCTGTTCCGGTCATGGAAAAAATGGAGCCCTTTGTGTTCTCCAACAATCAATTCGCCCTGAGCTGATTACCGAGGTGGTGTTAGCTGGCTGCAAGGGGATATGGACTGTATACCATAAAAGCTCACGCGGTCATGCAACTGATTCTTCTAAAACAATGACAGAGAATGATGAATATCATGCATATCTTATAATAAGCCTGGAGAGTCGTACAATGGTTCTTGAGACAGCTGATGATTTGGGAGAGGTTACTGAAACTGTTGATTATTATGTACATGGAAGTACAATTGCTGCAGGTAACTTATTTGGAAGGAGACGAGTTATTCAGATATACGCAAAGGGTGCACGTATACTAGATGGTTCTTACATGACCCAGGAATTGAATTTTGTTGCACATAATTCTGAGCAAACGTCTAGCGAATTGCCGACAGTGGCGTCTGTTTCTATAGCTGATCCTTATGTGTTATTGAAAATGACTGATGGAAGCATTCAATTGCTCCTTGGAGATCCTGCCGCTTGCACTGTTTCTCTTAATGCCCCTGCTATATTTTCAAGCTCAACAGAACCAATATCTGCATGTACACTTTATCATGATAAAGGACCGGAACCGTGGCTTCGAAAGACGAGCACTGATGCATGGCTTTCTACTGGTGTTGCCGAGCCAATTGATGGGAATGATGGATCATATCATGACCACGGTGACATATATTGTTTAGTTTGTTATGAAAATGGCAAACTTGAAATTTTCGATGTGCCTAGCTTTAAAAGTGTTTACTCTGTGGACAACTTTGTTTCTGGAAAGACTTATCTGGCGGATACATATACTAAAGACCCCAACAAATATCCTGATACTAAAGGCTATCTAAATAAGGAACCAGTGCAGAATATGAGAGTGGTCGAGCTGGCTATGCAAAGGTGGTCTGGCCGATACAGTCGTCCTTTTCTTTTTGGAATGCTGAGTGATGGGACAATTCTTTGCTACCATGCTTACTTTTACGAGGGTACAGAAAATGCTGTAAAAGGTGGAGATCCAGTTTCCCCTCGTGGTTCTGCGGACACAAGTAGTATGAGCATTTCAAGGCTGAGAAATTTGAGATTTCTTCGTGTTTCTATTGATATCACTACACGAGAAGAAATGTTGAATGCTGTGAGTAGGCCAAGAATTACTGTATTTAATAATGTGGGAGGCTATCAAGGTTTGTTTCTTAGTGGTTCAAGGCCAGCATGGCTCATGGTCTGCAGAGAACGGATTCGGGTACATCCGCAGCTATGCGATGGTTCCATAGCAGCTTTTGCTGTTCTTCACAATGTAAATTGCAATCATGGTCTTATATATGTTACATCACAGGGTTACCTAAAAATTTGTCAGCTGCCGTCGTCATTTAACTATGACAACCACTGGCCAGTTCAAAAGATTCCTTTGCTGGGCACTCCACATCAAGTCACCTATTATGCCGAAAAGAATCTATATCCACTTATTTTATCTGTTCCTGTTATCCGTCCTTTAAATCAGGTCCTTTCATCTCTGTTGGATCAAGAAATGAGCCAGCAGATAGATAACGACAACTTCAATTCTGATGATCTGCAAAAGACTTATAGTGTTGATGAATTTGAGGTCCGAATATTGGAGCCAGATAAATCTGGTCACTGGGATACTAAGGCTACTGTTCCAATGCAGACCTCTGAAAATGCCCTTACAGTCCGCATTGTTACGTTATTTAATACAACAACAAAAGAGAATGAATCTCTCATGGCCATTGGCACTGCTTATGTGCAAGGAGAGGATGTAGCTGCTCGTGGACGAGTGCTTCTGTTTTCTTTTGCCAAAACTAATGAGAGCTCCCAAAATCTGAAGTCTACTCTAAGGAGCTGA |
Protein: MSYAAFKMMHWPTGIENCAAGFFTHSPSSSSSSSAAAAEATAAAEIPPLPGDELEAAEWQQGRRRRGVGAVPDLVVTAGNVLEIYVVRAQEDEGRAPRASGEQKRGGGGGGVVDGISGARLELVCHYRLHGNVESMAVLSVGADNRSNRRDSIVLAFQDAKITVLEYDDSLHGLRTSSMHCFEGPDWQYLKRGRESFARGPIVKADPSGRCGGALVYGLQMIILKAAQAGQSLVGDDEPNSAGGTISVRIESSYVINLRELDMNHVKDFTFVHGYIEPVMVILHEREPTWAGRISWKHHTCMISALSVSTTLKQHPMIWSASNLPHDAYKLLAVPSPIGGVLVICANSIHYHSQSASCSLSLNSFSSQPDGSLEMPKSNFAVELDAAHATWLSHDVAMFSSKTGELLLLTLVYDGRIVQRLDLVKSKASVLTSGLTTIGSSFFFLGSRLGDSLLVQYSCGTSVPTSSQVKDEATDIDGDVPSAKRLRRMSSDALQDVTSVEELSLYNNAPNSSESAQKSFSFAVRDSLINVGPLKDFSYGLRINADPNSTGLAKQSNYELVCCSGHGKNGALCVLQQSIRPELITEVVLAGCKGIWTVYHKSSRGHATDSSKTMTENDEYHAYLIISLESRTMVLETADDLGEVTETVDYYVHGSTIAAGNLFGRRRVIQIYAKGARILDGSYMTQELNFVAHNSEQTSSELPTVASVSIADPYVLLKMTDGSIQLLLGDPAACTVSLNAPAIFSSSTEPISACTLYHDKGPEPWLRKTSTDAWLSTGVAEPIDGNDGSYHDHGDIYCLVCYENGKLEIFDVPSFKSVYSVDNFVSGKTYLADTYTKDPNKYPDTKGYLNKEPVQNMRVVELAMQRWSGRYSRPFLFGMLSDGTILCYHAYFYEGTENAVKGGDPVSPRGSADTSSMSISRLRNLRFLRVSIDITTREEMLNAVSRPRITVFNNVGGYQGLFLSGSRPAWLMVCRERIRVHPQLCDGSIAAFAVLHNVNCNHGLIYVTSQGYLKICQLPSSFNYDNHWPVQKIPLLGTPHQVTYYAEKNLYPLILSVPVIRPLNQVLSSLLDQEMSQQIDNDNFNSDDLQKTYSVDEFEVRILEPDKSGHWDTKATVPMQTSENALTVRIVTLFNTTTKENESLMAIGTAYVQGEDVAARGRVLLFSFAKTNESSQNLKSTLRS |